Serveur d'exploration sur l'OCR

Attention, ce site est en cours de développement !
Attention, site généré par des moyens informatiques à partir de corpus bruts.
Les informations ne sont donc pas validées.

Learning the Logic of Simple Phonotactics

Identifieur interne : 001C36 ( Main/Exploration ); précédent : 001C35; suivant : 001C37

Learning the Logic of Simple Phonotactics

Auteurs : F. Tjong Kim Sang [Belgique] ; John Nerbonne [Pays-Bas]

Source :

RBID : ISTEX:70C7D08C9A5C9D7C6CC886ADF30144BD359087A2

Abstract

Abstract: We report on experiments which demonstrate that by abductive inference it is possible to learn enough simple phonotactics to distinguish words from non-words for a simplified set of Dutch, the monosyllables. The monosyllables are distinguished in input so that segmentation is not problematic. Frequency information is withheld as is negative data. The methods are all tested using ten-fold cross-validation as well as a fixed number of randomly generated strings. Orthographic and phonetic representations are compared. The work presented in this chapter is part of a larger project comparing different machine learning techniques on linguistic data.

Url:
DOI: 10.1007/3-540-40030-3_7


Affiliations:


Links toward previous steps (curation, corpus...)


Le document en format XML

<record>
<TEI wicri:istexFullTextTei="biblStruct:series">
<teiHeader>
<fileDesc>
<titleStmt>
<title xml:lang="en">Learning the Logic of Simple Phonotactics</title>
<author>
<name sortKey="Tjong Kim Sang, F" sort="Tjong Kim Sang, F" uniqKey="Tjong Kim Sang F" first="F." last="Tjong Kim Sang">F. Tjong Kim Sang</name>
</author>
<author>
<name sortKey="Nerbonne, John" sort="Nerbonne, John" uniqKey="Nerbonne J" first="John" last="Nerbonne">John Nerbonne</name>
</author>
</titleStmt>
<publicationStmt>
<idno type="wicri:source">ISTEX</idno>
<idno type="RBID">ISTEX:70C7D08C9A5C9D7C6CC886ADF30144BD359087A2</idno>
<date when="2000" year="2000">2000</date>
<idno type="doi">10.1007/3-540-40030-3_7</idno>
<idno type="url">https://api.istex.fr/document/70C7D08C9A5C9D7C6CC886ADF30144BD359087A2/fulltext/pdf</idno>
<idno type="wicri:Area/Istex/Corpus">001E31</idno>
<idno type="wicri:Area/Istex/Curation">001D08</idno>
<idno type="wicri:Area/Istex/Checkpoint">001238</idno>
<idno type="wicri:doubleKey">0302-9743:2000:Tjong Kim Sang F:learning:the:logic</idno>
<idno type="wicri:Area/Main/Merge">001D35</idno>
<idno type="wicri:Area/Main/Curation">001C36</idno>
<idno type="wicri:Area/Main/Exploration">001C36</idno>
</publicationStmt>
<sourceDesc>
<biblStruct>
<analytic>
<title level="a" type="main" xml:lang="en">Learning the Logic of Simple Phonotactics</title>
<author>
<name sortKey="Tjong Kim Sang, F" sort="Tjong Kim Sang, F" uniqKey="Tjong Kim Sang F" first="F." last="Tjong Kim Sang">F. Tjong Kim Sang</name>
<affiliation wicri:level="4">
<country xml:lang="fr">Belgique</country>
<wicri:regionArea>CNTS - Language Technology Group, University of Antwerp</wicri:regionArea>
<placeName>
<settlement type="city">Anvers</settlement>
<region type="district" nuts="2">Province d'Anvers</region>
</placeName>
<orgName type="university">Université d'Anvers</orgName>
</affiliation>
<affiliation wicri:level="1">
<country wicri:rule="url">Belgique</country>
</affiliation>
</author>
<author>
<name sortKey="Nerbonne, John" sort="Nerbonne, John" uniqKey="Nerbonne J" first="John" last="Nerbonne">John Nerbonne</name>
<affiliation wicri:level="4">
<country xml:lang="fr">Pays-Bas</country>
<wicri:regionArea>Alfa-informatica, BCN, University of Groningen</wicri:regionArea>
<placeName>
<settlement type="city">Groningue (ville)</settlement>
<region>Groningue (province)</region>
</placeName>
<orgName type="university">Université de Groningue</orgName>
</affiliation>
<affiliation wicri:level="1">
<country wicri:rule="url">Pays-Bas</country>
</affiliation>
</author>
</analytic>
<monogr></monogr>
<series>
<title level="s">Lecture Notes in Computer Science</title>
<imprint>
<date>2000</date>
</imprint>
<idno type="ISSN">0302-9743</idno>
<idno type="ISSN">0302-9743</idno>
</series>
<idno type="istex">70C7D08C9A5C9D7C6CC886ADF30144BD359087A2</idno>
<idno type="DOI">10.1007/3-540-40030-3_7</idno>
<idno type="ChapterID">7</idno>
<idno type="ChapterID">Chap7</idno>
</biblStruct>
</sourceDesc>
<seriesStmt>
<idno type="ISSN">0302-9743</idno>
</seriesStmt>
</fileDesc>
<profileDesc>
<textClass></textClass>
<langUsage>
<language ident="en">en</language>
</langUsage>
</profileDesc>
</teiHeader>
<front>
<div type="abstract" xml:lang="en">Abstract: We report on experiments which demonstrate that by abductive inference it is possible to learn enough simple phonotactics to distinguish words from non-words for a simplified set of Dutch, the monosyllables. The monosyllables are distinguished in input so that segmentation is not problematic. Frequency information is withheld as is negative data. The methods are all tested using ten-fold cross-validation as well as a fixed number of randomly generated strings. Orthographic and phonetic representations are compared. The work presented in this chapter is part of a larger project comparing different machine learning techniques on linguistic data.</div>
</front>
</TEI>
<affiliations>
<list>
<country>
<li>Belgique</li>
<li>Pays-Bas</li>
</country>
<region>
<li>Groningue (province)</li>
<li>Province d'Anvers</li>
</region>
<settlement>
<li>Anvers</li>
<li>Groningue (ville)</li>
</settlement>
<orgName>
<li>Université d'Anvers</li>
<li>Université de Groningue</li>
</orgName>
</list>
<tree>
<country name="Belgique">
<region name="Province d'Anvers">
<name sortKey="Tjong Kim Sang, F" sort="Tjong Kim Sang, F" uniqKey="Tjong Kim Sang F" first="F." last="Tjong Kim Sang">F. Tjong Kim Sang</name>
</region>
<name sortKey="Tjong Kim Sang, F" sort="Tjong Kim Sang, F" uniqKey="Tjong Kim Sang F" first="F." last="Tjong Kim Sang">F. Tjong Kim Sang</name>
</country>
<country name="Pays-Bas">
<region name="Groningue (province)">
<name sortKey="Nerbonne, John" sort="Nerbonne, John" uniqKey="Nerbonne J" first="John" last="Nerbonne">John Nerbonne</name>
</region>
<name sortKey="Nerbonne, John" sort="Nerbonne, John" uniqKey="Nerbonne J" first="John" last="Nerbonne">John Nerbonne</name>
</country>
</tree>
</affiliations>
</record>

Pour manipuler ce document sous Unix (Dilib)

EXPLOR_STEP=$WICRI_ROOT/Ticri/CIDE/explor/OcrV1/Data/Main/Exploration
HfdSelect -h $EXPLOR_STEP/biblio.hfd -nk 001C36 | SxmlIndent | more

Ou

HfdSelect -h $EXPLOR_AREA/Data/Main/Exploration/biblio.hfd -nk 001C36 | SxmlIndent | more

Pour mettre un lien sur cette page dans le réseau Wicri

{{Explor lien
   |wiki=    Ticri/CIDE
   |area=    OcrV1
   |flux=    Main
   |étape=   Exploration
   |type=    RBID
   |clé=     ISTEX:70C7D08C9A5C9D7C6CC886ADF30144BD359087A2
   |texte=   Learning the Logic of Simple Phonotactics
}}

Wicri

This area was generated with Dilib version V0.6.32.
Data generation: Sat Nov 11 16:53:45 2017. Site generation: Mon Mar 11 23:15:16 2024